This page last changed on Jun 29, 2009 by gobbert.

Jacob Kogan, Department of Mathematics and Statistics

Clustering is used in a number of traditionally distant fields to describe methods for grouping of unlabeled data. Clustering very large datasets is a contemporary data mining challenge. This project concerns an application of Principal Direction Divisive Partitioning clustering algorithm (PDDP) introduced by D. Boley to a dataset residing in a number of computers connected in a network. Performance of PDDP and Distributed PDDP for datasets of moderate size will be compared.

Document generated by Confluence on Mar 31, 2011 15:37